Search CORE

86 research outputs found

Interaction of perceptual grouping and crossmodal temporal capture in tactile apparent-motion

Author: A O'Leary
A Pastukhov
C Scheier
C Spence
D Alais
D Talsma
DA Slutsky
DH Brainard
E Freeman
G Von Bekesy
GH Recanzone
Hermann J. Müller
IJ Hirsh
J Vroomen
J Vroomen
J Vroomen
JP Bresciani
JP Bresciani
Justin Harris
K Koffka
Lihan Chen
M Keetels
M Zampini
MO Ernst
P Bertelson
P Bruns
P Cavanagh
P Mamassian
R Desimone
R Sekuler
RA Berman
RB Welch
RB Welch
S Exner
S Gepshtein
S Getzmann
S Morein-Zamir
S Shimojo
S Soto-Faraco
S Soto-Faraco
W Köhler
Z Shi
Zhuanghua Shi
ZL Lu
Publication venue: 'Public Library of Science (PLoS)'
Publication date: 01/01/2011
Field of study

Previous studies have shown that in tasks requiring participants to report the direction of apparent motion, task-irrelevant mono-beeps can "capture'' visual motion perception when the beeps occur temporally close to the visual stimuli. However, the contributions of the relative timing of multimodal events and the event structure, modulating uni- and/or crossmodal perceptual grouping, remain unclear. To examine this question and extend the investigation to the tactile modality, the current experiments presented tactile two-tap apparent-motion streams, with an SOA of 400 ms between successive, left-/right-hand middle-finger taps, accompanied by task-irrelevant, non-spatial auditory stimuli. The streams were shown for 90 seconds, and participants' task was to continuously report the perceived (left-or rightward) direction of tactile motion. In Experiment 1, each tactile stimulus was paired with an auditory beep, though odd-numbered taps were paired with an asynchronous beep, with audiotactile SOAs ranging from -75 ms to 75 ms. Perceived direction of tactile motion varied systematically with audiotactile SOA, indicative of a temporal-capture effect. In Experiment 2, two audiotactile SOAs-one short (75 ms), one long (325 ms)-were compared. The long-SOA condition preserved the crossmodal event structure (so the temporal-capture dynamics should have been similar to that in Experiment 1), but both beeps now occurred temporally close to the taps on one side (even-numbered taps). The two SOAs were found to produce opposite modulations of apparent motion, indicative of an influence of crossmodal grouping. In Experiment 3, only odd-numbered, but not even-numbered, taps were paired with auditory beeps. This abolished the temporal-capture effect and, instead, a dominant percept of apparent motion from the audiotactile side to the tactile-only side was observed independently of the SOA variation. These findings suggest that asymmetric crossmodal grouping leads to an attentional modulation of apparent motion, which inhibits crossmodal temporal-capture effects

Public Library of Science (PLOS)

Crossref

Directory of Open Access Journals

PubMed Central

Open Access LMU

Birkbeck Institutional Research Online

Enhanced Visual Temporal Resolution in Autism Spectrum Disorders

Author: A Giersch
A Shah
Anthony J. Bailey
B Hommel
C Lord
C Lord
C Trevarthen
Christine M. Falter
CM Falter
CM Falter
CRG Jones
D Arnold
D Wechsler
D Wimpory
DI Shore
E Husserl
E Pöppel
E Pöppel
E Szelag
F Cavallini
F Happe
GA Brecher
GL Wallace
H Schmidt
IJ Hirsh
J Boucher
J Bryson
JH Foss-Feig
JR Foucher
JS Martin
K Plaisted
L Bennetto
L Mottron
LB Stelmach
LD Kwakye
M Laasonen
MA Elliott
MA Elliott
Mark A. Elliott
MJ Allman
MJ Allman
P Tallal
S Dakin
S Exner
S Pockett
SH Mostofsky
SR Leekam
T Nakano
Warren H. Meck
Publication venue: Public Library of Science
Publication date: 01/01/2012
Field of study

Cognitive functions that rely on accurate sequencing of events, such as action planning and execution, verbal and nonverbal communication, and social interaction rely on well-tuned coding of temporal event-structure. Visual temporal event-structure coding was tested in 17 high-functioning adolescents and adults with autism spectrum disorder (ASD) and mental- and chronological-age matched typically-developing (TD) individuals using a perceptual simultaneity paradigm. Visual simultaneity thresholds were lower in individuals with ASD compared to TD individuals, suggesting that autism may be characterised by increased parsing of temporal event-structure, with a decreased capability for integration over time. Lower perceptual simultaneity thresholds in ASD were also related to increased developmental communication difficulties. These results are linked to detail-focussed and local processing bias

CiteSeerX

Public Library of Science (PLOS)

Crossref

Proceedings - University of Groningen

University of Groningen

ARTS repository - University of Groningen

Directory of Open Access Journals

PubMed Central

Access to Research at National University of Ireland, Galway

Dissertations of the University of Groningen

Audio-Visual Speech Timing Sensitivity Is Enhanced in Cluttered Conditions

Author: A Alsius
A Vatakis
A Vatakis
A Vatakis
AJ King
C Spence
CV Jackson
D Alais
D Sanabria
Derek H. Arnold
DH Arnold
DH Arnold
DH Arnold
DH Arnold
DH Arnold
DM Green
E Van der Burg
E Van der Burg
E Van der Burg
FA Wichmann
H McGurk
IJ Hirsh
IP Howard
J Lewald
J Lewald
J Navarra
J Navarra
J Navarra
J Vroomen
J Vroomen
JJ Stekelenburg
Justin Harris
K Watanabe
KW Grant
LA Cook
M Keetels
M Radeau
M Zampini
N Miner
NA Macmillan
NF Dixon
R Fendrich
R Guski
R Powers
RB Welch
S Morein-Zamir
S Nishida
Shin'ya Nishida
V van Wassenhove
W Fujisaki
W Fujisaki
W Fujisaki
W Fujisaki
W Roseboom
Waka Fujisaki
Warrick Roseboom
Publication venue: Public Library of Science
Publication date: 01/04/2011
Field of study

Events encoded in separate sensory modalities, such as audition and vision, can seem to be synchronous across a relatively broad range of physical timing differences. This may suggest that the precision of audio-visual timing judgments is inherently poor. Here we show that this is not necessarily true. We contrast timing sensitivity for isolated streams of audio and visual speech, and for streams of audio and visual speech accompanied by additional, temporally offset, visual speech streams. We find that the precision with which synchronous streams of audio and visual speech are identified is enhanced by the presence of additional streams of asynchronous visual speech. Our data suggest that timing perception is shaped by selective grouping processes, which can result in enhanced precision in temporally cluttered environments. The imprecision suggested by previous studies might therefore be a consequence of examining isolated pairs of audio and visual events. We argue that when an isolated pair of cross-modal events is presented, they tend to group perceptually and to seem synchronous as a consequence. We have revealed greater precision by providing multiple visual signals, possibly allowing a single auditory speech stream to group selectively with the most synchronous visual candidate. The grouping processes we have identified might be important in daily life, such as when we attempt to follow a conversation in a crowded room

Public Library of Science (PLOS)

Crossref

Directory of Open Access Journals

PubMed Central

University of Queensland eSpace

Optimal perceived timing: integrating sensory information with dynamically updated expectations

Author: A Breska
A Sciutti
AA Correa
AC Nobre
AM Cravo
BH Merker
BH Repp
BHB Repp
C Miall
C Spence
CC Drake
CD Creelman
CS Herrmann
D Burr
D Kersten
D Vorberg
DM Eagleman
DM Wolpert
DV Buonomano
E Van der Burg
EW Large
EW Large
EW Large
F Carnevale
FH Petzschner
FJ Blommaert
G Rohenkohl
G Rohenkohl
GG Hoopen ten
GM Cicchini
HH Schulze
IJ Hirsh
J Gibbon
J Gibbon
J Gibbon
J Hartcher-O’Brien
J Miller
J Vroomen
JD McAuley
JJ Miller
JJJ McDonald
JT Coull
JT Coull
K Friston
K Friston
L Ortega
LT Boenke
M Di Luca
M Jazayeri
M Miyazaki
M Schwartze
M Treisman
MJ Henry
MR Jones
MT Elliott
MT Elliott
N Axmacher
N Escoffier
NSN Miller
NW Roach
P Cardoso-Leite
P Janssen
P Jaśkowski
P Kok
P Lakatos
P Mamassian
PA Lewis
R Barnes
R Brochard
R Hyman
R Rutschmann
R VanRullen
RM Church
UR Karmarkar
WJ Ma
Y Sato
Z Shi
Publication venue: 'Springer Science and Business Media LLC'
Publication date: 07/07/2016
Field of study

The environment has a temporal structure, and knowing when a stimulus will appear translates into increased perceptual performance. Here we investigated how the human brain exploits temporal regularity in stimulus sequences for perception. We find that the timing of stimuli that occasionally deviate from a regularly paced sequence is perceptually distorted. Stimuli presented earlier than expected are perceptually delayed, whereas stimuli presented on time and later than expected are perceptually accelerated. This result suggests that the brain regularizes slightly deviant stimuli with an asymmetry that leads to the perceptual acceleration of expected stimuli. We present a Bayesian model for the combination of dynamically-updated expectations, in the form of a priori probability of encountering future stimuli, with incoming sensory information. The asymmetries in the results are accounted for by the asymmetries in the distributions involved in the computational process

Crossref

University of Birmingham Research Portal

Nottingham Trent Institutional Repository (IRep)

PubMed Central

Sussex Research Online

Neural mechanisms of interstimulus interval-dependent responses in the primary auditory cortex of awake cats

Author: A Bieser
A Destexhe
A Destexhe
A Hodgkin
B Grothe
BH Gaese
CE Garabedian
CE Schreiner
CL Cox
D Golomb
D Golomb
D Hansel
DP Phillips
EL Bartlett
F de Ribaupierre
G Langner
G Miceli
GE Bruder
H Kamiya
H Ojima
H Schulze
HT Kyriazi
IJ Hirsh
JC Licklider
JJ Eggermont
JJ Eggermont
JJ Eggermont
JJ Eggermont
JJ Eggermont
JL Flanagan
JM Goldberg
KN Stevens
KT Nakamoto
KV Mardia
L Liang
L Lisker
L Qin
L Qin
LE Dobrunz
Ling Qin
LM Miller
M Brosch
M Galarreta
M Sakai
M Steinschneider
M Wehr
Masashi Sakai
ML Albert
N Guttman
N Maruyama
NP Franks
OD Creutzfeldt
P Fraisse
P Jonas
P Müller-Preuss
P Stern
P Zurita
PK Kuhl
PK Kuhl
PX Joris
R Kretz
R Metherate
RA Reale
RJ Dooling
RM Warren
RS Zucker
S Bao
S Chimoto
SH Auerbach
SL Liang
SM Sherman
Sohei Chimoto
SR Jones
T Lu
TS Otis
VN Murthy
vN Steinbüchel
XJ Wang
XJ Wang
Y Amitai
YI Fishman
Yu Sato
Publication venue: BioMed Central
Publication date: 01/01/2009
Field of study

Abstract Background Primary auditory cortex (AI) neurons show qualitatively distinct response features to successive acoustic signals depending on the inter-stimulus intervals (ISI). Such ISI-dependent AI responses are believed to underlie, at least partially, categorical perception of click trains (elemental vs. fused quality) and stop consonant-vowel syllables (eg.,/da/-/ta/continuum). Methods Single unit recordings were conducted on 116 AI neurons in awake cats. Rectangular clicks were presented either alone (single click paradigm) or in a train fashion with variable ISI (2–480 ms) (click-train paradigm). Response features of AI neurons were quantified as a function of ISI: one measure was related to the degree of stimulus locking (temporal modulation transfer function [tMTF]) and another measure was based on firing rate (rate modulation transfer function [rMTF]). An additional modeling study was performed to gain insight into neurophysiological bases of the observed responses. Results In the click-train paradigm, the majority of the AI neurons ("synchronization type"; <it>n </it>= 72) showed stimulus-locking responses at long ISIs. The shorter cutoff ISI for stimulus-locking responses was on average ~30 ms and was level tolerant in accordance with the perceptual boundary of click trains and of consonant-vowel syllables. The shape of tMTF of those neurons was either band-pass or low-pass. The single click paradigm revealed, at maximum, four response periods in the following order: 1st excitation, 1st suppression, 2nd excitation then 2nd suppression. The 1st excitation and 1st suppression was found exclusively in the synchronization type, implying that the temporal interplay between excitation and suppression underlies stimulus-locking responses. Among these neurons, those showing the 2nd suppression had band-pass tMTF whereas those with low-pass tMTF never showed the 2nd suppression, implying that tMTF shape is mediated through the 2nd suppression. The recovery time course of excitability suggested the involvement of short-term plasticity. The observed phenomena were well captured by a single cell model which incorporated AMPA, GABAA, NMDA and GABAB receptors as well as short-term plasticity of thalamocortical synaptic connections. Conclusion Overall, it was suggested that ISI-dependent responses of the majority of AI neurons are configured through the temporal interplay of excitation and suppression (inhibition) along with short-term plasticity.</p

Crossref

Springer - Publisher Connector

Directory of Open Access Journals

PubMed Central

The Natural Statistics of Audiovisual Speech

Author: AA Ghazanfar
AA Ghazanfar
AA Ghazanfar
AA Ghazanfar
AA Ghazanfar
AA Ghazanfar
AL Giraud
Alice Caplier
Andrea Trubanova
Asif A. Ghazanfar
C Abry
C Chandrasekaran
C Kayser
C Rajkai
CE Schroeder
Chandramouli Chandrasekaran
CR Lansing
D Poeppel
D Sodoyer
D Sodoyer
E Ahissar
E Vatikiotis-Bateson
EP Simoncelli
G Buzsaki
G Monaci
GS Pollack
H Barlow
H Luo
H McGurk
H Yehia
HC Yehia
IJ Hirsh
J Kim
J Ohala
J Westbury
JS Garofolo
JX Maier
JX Maier
K Munhall
K Munhall
K Munhall
K Saberi
K von Kriegstein
K von Kriegstein
Karl J. Friston
KG Munhall
KG Munhall
KMG Fu
KW Grant
L Smith
LD Rosenblum
LD Rosenblum
LD Rosenblum
LD Rosenblum
LD Rosenblum
M Cooke
M Kamachi
M Lungarella
M Sams
M Vitkovitch
M Vitkovitch
MR Jarvis
N Eveno
NC Singh
NF Dixon
P Cosi
P Lakatos
P Lakatos
P Lieberman
P Suppes
PP Mitra
Q Summerfield
Q Summerfield
R Campbell
R Drullman
R Drullman
R Pfeifer
RT Canolty
RV Shannon
S Greenberg
S Stillittano
SJ Kiebel
Sébastien Stillittano
T Lallouache
U Werner-Reiss
V van Wassenhove
V van Wassenhove
ZM Smith
Publication venue: Public Library of Science
Publication date: 01/07/2009
Field of study

Humans, like other animals, are exposed to a continuous stream of signals, which are dynamic, multimodal, extended, and time varying in nature. This complex input space must be transduced and sampled by our sensory systems and transmitted to the brain where it can guide the selection of appropriate actions. To simplify this process, it's been suggested that the brain exploits statistical regularities in the stimulus space. Tests of this idea have largely been confined to unimodal signals and natural scenes. One important class of multisensory signals for which a quantitative input space characterization is unavailable is human speech. We do not understand what signals our brain has to actively piece together from an audiovisual speech stream to arrive at a percept versus what is already embedded in the signal structure of the stream itself. In essence, we do not have a clear understanding of the natural statistics of audiovisual speech. In the present study, we identified the following major statistical features of audiovisual speech. First, we observed robust correlations and close temporal correspondence between the area of the mouth opening and the acoustic envelope. Second, we found the strongest correlation between the area of the mouth opening and vocal tract resonances. Third, we observed that both area of the mouth opening and the voice envelope are temporally modulated in the 2–7 Hz frequency range. Finally, we show that the timing of mouth movements relative to the onset of the voice is consistently between 100 and 300 ms. We interpret these data in the context of recent neural theories of speech which suggest that speech communication is a reciprocally coupled, multisensory event, whereby the outputs of the signaler are matched to the neural processes of the receiver

Public Library of Science (PLOS)

Princeton University Open Access Repository

Crossref

Hal - Université Grenoble Alpes

Directory of Open Access Journals

PubMed Central

Monkeys and Humans Share a Common Computation for Face/Voice Integration

Author: A Diederich
A Diederich
A Diederich
A Diederich
A Ghazanfar
A Izumi
AA Ghazanfar
AA Ghazanfar
AA Ghazanfar
AA Ghazanfar
AA Ghazanfar
AA Ghazanfar
AH Bell
AK Churchland
AM Burrows
Andrea Trubanova
Asif A. Ghazanfar
BA Rowland
BD Corneil
BE Stein
BE Stein
BE Stein
BE Stein
BE Stein
C Cappe
C Chandrasekaran
C Chandrasekaran
CC Sherwood
CC Sherwood
CC Sherwood
CC Sherwood
Chandramouli Chandrasekaran
D Alais
D Reisberg
D Senkowski
DE Callan
DE Shub
DH Raab
E Huber
E Huber
E Kohler
E Vatikiotis-Bateson
FM Plat
G Gourevitch
G Musacchia
GA Calvert
H Colonius
H Colonius
H McGurk
H Yehia
HC Hughes
I Skaliora
IJ Hirsh
IR Lansing
J Besle
J Miller
J Miller
J Miller
J Miller
J Navarra
J Ohala
J Sliwa
J Todd
J-L Schwartz
JD Roitman
JL Flanagan
JP Egan
K von Kriegstein
KG Munhall
KG Munhall
KW Grant
LA Parr
LA Ross
LC Populin
LD Rosenblum
LE Bernstein
LH Arnal
Luis Lemus
M Avillac
M Giray
M Gondan
M Gondan
M Hershenson
M Murase
MA Frens
MA Meredith
Matthias Gondan
MD Hauser
MD Hauser
MH Giard
ML Patterson
ML Patterson
MO Ernst
MO Ernst
NE Barraclough
NF Dixon
Olaf Sporns
PK Kuhl
Q Summerfield
Q Summerfield
RA Stevenson
RJ Andrew
S Ouni
SR Partan
T Sugihara
T Yang
TA Evans
TE Rowell
TM Wright
TR Stanford
V Klucharev
V van Wassenhove
V van Wassenhove
W Jiang
W Schwarz
W Schwarz
WH Sumby
WJ Ma
Publication venue: Public Library of Science
Publication date: 01/01/2011
Field of study

Speech production involves the movement of the mouth and other regions of the face resulting in visual motion cues. These visual cues enhance intelligibility and detection of auditory speech. As such, face-to-face speech is fundamentally a multisensory phenomenon. If speech is fundamentally multisensory, it should be reflected in the evolution of vocal communication: similar behavioral effects should be observed in other primates. Old World monkeys share with humans vocal production biomechanics and communicate face-to-face with vocalizations. It is unknown, however, if they, too, combine faces and voices to enhance their perception of vocalizations. We show that they do: monkeys combine faces and voices in noisy environments to enhance their detection of vocalizations. Their behavior parallels that of humans performing an identical task. We explored what common computational mechanism(s) could explain the pattern of results we observed across species. Standard explanations or models such as the principle of inverse effectiveness and a “race” model failed to account for their behavior patterns. Conversely, a “superposition model”, positing the linear summation of activity patterns in response to visual and auditory components of vocalizations, served as a straightforward but powerful explanatory mechanism for the observed behaviors in both species. As such, it represents a putative homologous mechanism for integrating faces and voices across primates

Public Library of Science (PLOS)

Princeton University Open Access Repository

Crossref

University of Regensburg Publication Server

Directory of Open Access Journals

PubMed Central